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HOW MANY TUPLES OF GROUP ELEMENTS HAVE A GIVEN PROPERTY? 

Anton A. Klyachko Anna A. Mkrtchyan 

Faculty of Mechanics and Mathematics, Moscow State University 
Moscow 119991, Leninskie gory, MSU 
klyachkofDmech.math.msu.su anna.mkr@gmail.com 

with an appendix by Dmitrii V. Trushin 

Generalising Solomon's theorem, C. Gordon and F. Rodriguez- Villegas have proven recently that, in any group, the number 
of solutions to a system of coefficient-free equations is divisible by the order of this group whenever the rank of the matrix 
composed of the exponent sums of ith unknown in jth equation is less than the number of unknowns. We generalise this result 
in two directions: first, we consider equations with coefficients, and secondly, we consider not only systems of equations but 
also any first-order formulae in the group language (with constants). Our theorem implies some amusing facts; for example, the 
' number of group elements whose squares lie in a given subgroup is divisible by the order this subgroup. 



0. Introduction 

P? 

r K . Solomon theorem [Solo69] . In any group, the number of solutions to a system of coefficient-free equations is divisible 
' by the order of this group if the number of equations is less than the number of unknowns. 

This topic was developed in different directions (see, e.g. [Stru95], [AmVll], [Isaa70], and references therein), but 
£h ! the simplest and most natural generalisation of this theorem was obtained quite recently. 

Gordon— Rodriguez- Villegas theorem [GRV12]. In any group, the number of solutions to a system of coefEcient- 
■ free equations is divisible by the order of this group if the rank of the matrix composed of exponent sums of ith 
| unknown in jth equation is less than the number of unknowns. 

For example, the Solomon theorem does not applies to the system x 2 y 3 [x, = 1 = (yx) 2 , but nevertheless, the 

number of its solutions in a group is divisible by the order of the group according to the Gordon-Rodriguez- Villegas 
" theorem, because the rank of the corresponding matrix is one while there are two unknowns. 

What if the rank of the matrix is much less than the number of unknowns? Does this imply that the number 
04 ' of solutions must be divisible by a higher power of the order of the group? The answer is no. And even a weaker 
I . conjecture is false. In Section 2, we give an example. 

We generalise the Gordon-Rodriguez- Villegas theorem in other directions. We study equations with coefficients, 
k> ■ and not only systems of equations but also any first-order formulae. The main theorem implies a lot of non-obvious 
; i \ facts, e.g. that mentioned in the abstract. Section 1 contains the main theorem. In Section 2, we give several examples. 
& • In Section 3, we prove the main theorem. In particular, our argument proves the Gordon-Rodriguez- Villegas theorem 
and is (in this case) somewhat simpler than the original one (from our point of view), although our proof is based on 
the same ideas. In Section 4, we give a direct proof of the amusing fact from the abstract. We have not succeeded in 
finding this fact in literature, though it might be easily obtained from a result of P. Hall (see Section 1) generalising 
the well-known Frobenius theorem [Frob03] (see also [Hall59]) which says that the number of solutions to the equation 
x n = g is divisible by GCD(n, \C(g)\). The Frobenius theorem was generalised in various directions (see, e.g. [Hall36], 
[Kula38], [Sehg62], [BrTh88], [AsTaOl], and references therein). 

The appendix written by D. V. Trushin contains a proof of a pure logical proposition which makes it possible to 
simplify somewhat the statement of a corollary of the main theorem at the expense of preliminarily transformation of 
the logic formula. 

Our notation is mainly standard. Note only that if k £ Z and x and y are elements of a group, then x v , x ky , and x~ v 
denote y~ Y xy, y~ 1 x k y and y~ 1 x~ 1 y, respectively. A commutator [x,y] is x~ 1 y~ 1 xy. If X is a subset of a group, then 
\X\, (X), and C(X) denote the cardinality of X, the subgroup generated by X, and the centraliser of X. The letter Z 
denotes the set of integers. 

The authors thank A. V. Vasilev and I. M. Isaacs for useful remarks. 
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1. Main theorem 

Let G be a group. We consider the group language L over the group G. This language has two functional symbols: 
• and _1 , and also, for each element of G, there is a constant symbol g. We do not assume that the group is finite 
(although it is in the majority of interesting cases) ; the assertions about divisibility should be understood in the sense 
of cardinal arithmetics: any infinite cardinal is divisible by any smaller or equal nonzero cardinal (and zero is divisible 
by any cardinal). 

Consider an arbitrary first-order formula <p in the language L. We assume that this formula is written in the form 

3q ip 

and each atomic subformula of ip has the form 

u = 1 or q~ uq = ft, 

where the words u E G * F (possibly, different for different subformulae) can contain free and bound variables and 
elements of G (called coefficients of tp) but cannot contain the variable q (this special variable occurs nowhere else 
in ip). Here, G * F is the free product of G and the free group F whose basis is the set of all variables of ip, except q. 
Atomic subformulae of the first form are called homogeneous equations of ip, atomic subformulae of the second form 
are called inhomogeneous equations of <p. The elements ft e G (they may also be different for different subformulae) 
are called free terms and are not considered as coefficients of the formula. Certainly, any formula ip is equivalent to 
a formula of the specified form, because we can transfer everything to the left-hand side in atomic subformulae and 
consider all equations as homogeneous; in this case, the special variable q docs not occur in ip at all. 

Now, we define the graph T(<p) of the formula ip as follows. The vertices of T(p) are all bound variables of ip. Each 
atomic subformula (homogeneous or inhomogeneous) containing bound variables has the form 

vi(yi)w 1 (x 1 ...,x n ). . .v r {y r )w r {x 1 ...,x n )=h, 

where yi are bound variables of the formula ip (not necessarily different), x± . . . ,x n are all (different) free variables 
of tp, the words Vi(yi) belong to the free product G* {yi) ^ of G and the infinite cyclic group generated by the letter yi, 
the words Wi(x\ . . . , x n ) belong to the free product G * F(xi . . . , x n ) of G and the free group with basis x\ . . . , x n , 
and h G G. Let us connect the vertices y% and yi+\ (subscripts modulo r) by a directed edge labelled by an integer 
tuple (a.\, ...,a n ), where ctj is the exponent sum of Xj in ivf, loops labelled by zero tuples are excluded. We apply 
this construction to each atomic subformula containing bound variables. 
For instance, if the formula <p(xi, x%) has the form 3q ip and ip is*) 

VyBz (([ygy, x 1 gx 2 ]x 1 zx^ x\hx 7 2 = l) A -. (z s (x 2 x 1 ) 2 = l) V {(x\x 2 2 f = l)) , (1) 
where g,h € G are some fixed elements, then the graph T(p) looks as follows:**' 

(-1,-1) (2,1) (2,2) 



(3,4) 





Fig. f 

Now, in the graph T(p), we choose cycles Ci, C2, . . . generating the first homology group (e.g. we can take generators 
of the fundamental groups of all components), and compose the matrix A(ip) of <p as follows: for each generating cycle Cj 
we write a row which is the sum of labels of the edges of this cycle, and then we add rows consisting of exponent sums 
of atomic subformulae containing no bound variables. 

This matrix depends on the choice of generating cycles but the integer linear hull of its rows is determined uniquely 
by the formula <p. For the example above, the matrix A(ip) is 



A(<p) 



( -l V 

5 5 

2 2 

V fO fO 



(for an obvious choice of three generating cycles). (2) 



*' We do not assume that the formula is always in the prenex normal form (i.e. all quantifiers are moved outside), 
as in this example. Another feature of this example is that ip does not depend on q; so, <p = tp. 
**) We omit the isolated vertex q. Isolated vertices play no role in this subject. 
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Theorem. If the rank of the matrix A(tp) of a formula tp is less than the number of free variables of this formula, 
then the number of tuples of group elements satisfying formula tp is divisible by the order of the centraliser of the set 
of all coefficients of p. In particular, this number is divisible by the order of the group if all coefficients equal 1. 

For the example above, the rank of the matrix is one and there are two free variables. Therefore, the theorem 
asserts that the cardinality of the set 

{{xi,x 2 ) G G 2 ; Vy3z (([ygy,x 1 gx 2 }xizx^ 3 x 3 1 hxl = l) A^(z 8 (x 2 x 1 ) 2 = l) V {{x\x%f = l))} 

is divisible by \C(g,h)\. In the following section, we give more instructive examples. 

Corollary 1. The number of solutions to a system of equations in a group is divisible by the order of the centraliser 
of the set of coefficients if the rank of the matrix of this system is less than the number of unknowns. 

This corollary transforms into the Gordon-Rodriguez- Villegas theorem in the case where all coefficients equal 1. 

Gordon Rodriguez- Villegas conjugation theorem ([GRV12], Corollary 3.5*'). 

Let {wj(x\, . . . , x n )} C F(x\, . . . , x n ) be a set of words (elements of a free group) such that the rank of the matrix 
composed of exponent sums of Xi in Wj is less than n. Then, for any group G and any elements hj G G, the number 
of tuples satisfying the formula 

3 

is divisible by the order of G. 

This statement (generalizing a conjugation theorem from [Solo69]) is obviously stronger than the Gordon- 
Rodriguez- Villegas theorem from the introduction. Our theorem gives even stronger fact: the pointwise conjugation is 
replaced by a common conjugation. 

Corollary 2. Under the condition of the Gordon-Rodriguez-Villegas conjugation theorem, the number of tuples 
satisfying the formula 

3q (w j (x 1 ,...,x n ) = q^hjq^j , 

is divisible by the order of G. 

(If Wj(xi, . . . , x n ) G G * F(xi, . . . , x n ), then the number of satisfying tuples is divisible by the order of the centraliser 
of the set of all coefficients of the words Wj.) 

Proof. The graph contains the only vertex q and the matrix of the formula coincides with the matrix from the 
Gordon-Rodriguez- Villegas conjugation theorem. So, Corollary 2 follows from the main theorem. 

The following statement is an analogue of the Solomon theorem (and transforms into it in the case when all 
coefficients equal 1, formula is quantifier- free and is a conjunction of equalities). 

Corollary 3. The number of tuples of group elements satisfying formula tp is divisible by the order of the centraliser 
of the set of all coefficients of tp (in particular, this number is divisible by the order of the group provided all coefficients 
equal 1 ) if, in the formula tp, 

(the number of proper occurrences of bound variables)+ 

+(the number of components of r(<^))+ (*) 
+(the number of atomic sub formulae containing no bound variables) < (the number of all variables). 

An occurrence of a variable y is a maximal subword in the left-hand side of an equation (the left-hand side is 
considered as a cyclic word) containing the variable y and no other variables; An occurrence is called proper if it does 
not coincide with the entire left-hand side of the equation. In the example above, there are two occurrences of y and 
two occurrences of z, that is four occurrences of bound variables; all these occurrences are proper. The number of 
occurrences of the special variable q is always equal to the number of inhomogeneous equations (because we consider 
left-hand sides of equations as cyclic words). 

Proof. The rank of the first homology group of a graph equals to the number of edges minus the number of vertices 
plus the number of connected components. The number of vertices equals to the number of bound variables, the 
number of edges equals to the number of occurrences of bound variables; improper occurrences give loops with zero 

*) translated into a language convenient for our purposes. 
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labels. Therefore, the rank of the matrix A(ip) is an most the left-hand side of (*) minus the number of bound variables. 
It remains to apply the theorem. 

It is interesting that actually the graph T(ip) can always be assumed to be connected, in the sense that the graph 
of a formula written economically, i.e. with minimal possible number of bound variables (among formulae equivalent to 
the given one) is always connected. For example, any formula with two bound variables that do not occur together in 
atomic subformulae is equivalent to a formula with one bound variable. Dima Trushin proves this fact in the appendix. 

Corollary 4. The number of group elements whose k-th powers belong to a given subgroup is divisible by the order 
of this subgroup. 

Proof. Let H be a subgroup of a group G. We are interested in elements x such that x k E H. First, suppose that 
the subgroup H is the centraliser of a subgroup A. Then the inclusion x E H is equivalent to the system of equations 
{ [x k , a] = 1 ; a E A} satisfying the main theorem (there are no bound variables and the matrix is zero) . Therefore, 
the number of solutions is divisible by the order of the centraliser of the set of coefficients, i.e. by \H\, as required. 

Now let H be an arbitrary subgroup. Let us use the following^ trick. We embed the group G in a larger group G 
in such a way that H becomes the centraliser of a subgroup A of G. In addition, we must guarantee that all solutions 
of our system of equations over G belong to G. For this sake, we make G to be the centraliser of another subgroup 
B C G and consider the system of equations 

(/\([x fe ,a] = l))A(/\([x,6] = l)). 

\aeA ) \beB / 

This will prove Corollary 4 in the general case. 

The role of G can be played by the amalgamated free product G = (B x G) ^ (H x A), where A and B are 

H 

arbitrary nontrivial centreless groups. Clearly, C(A) = H and C(B) = G, i.e. the solutions to our system of equations 
are precisely the elements of G whose fcth powers lie in H. According to the main theorem, the number of solutions 
is divisible by \C(A) n C(B)\ = \HnG\ = \H\, as required. 

This proof of Corollary 4 uses only a very particular case of the main theorem, where the formula ip is a system 
of equations with one unknown. This case of the main theorem follows immediately from an old result of P. Hall 
(generalising the Frobenius theorem). 

Hall theorem ([Hall36], Theorem II). In any group, the number of solutions to a system of equations with one 
unknown is divisible by GCD(|C|, m, n 2 , . . .), where C is the centraliser of the set of coefficients and ni is the exponent 
sum of the unknown in i-th equation. 

A trick (another) allowing to transform an arbitrary subgroup into a centraliser can also be found in [Hall36]. In 
Section 4, we give a direct proof of Corollary 4 illustrating a part of the proof of the main theorem. 

A more general fact (which also might be derived from a known result [Sehg62]) can be obtained similarly. 

Corollary 5. Suppose that H is a subgroup of a group G and W is a subgroup (or a subset) of a Gnitely generated 
free group F. Then the number of homomorphisms f:F^G such that f(W) C H is divisible by \H\. 

Proof. This number is the number of solutions to the system of equations 

A ([w, a] = 1) A ([x, b] = 1) J , where A is a basis of F, 

K aeA, w<=W J \beB xex J 

in the group G = (B x G) H< (H x A), where A and B are arbitrary nontrivial centreless groups. According to the 

H 

main theorem, the number of solutions is divisible by \C{A) f~1 C(B)\ = \H n G\ = \H\, as required. 
2. Examples 

We start with a curious application of the Solomon theorem. 

Example 1. We say that two elements of a group belong to the same tribe if their squares are equal. Clearly, the 
total size of all tribes is the order of the group. It is less obvious that 

the sum of 2013th powers of tribe sizes is a multiple of the order of the group. 
To prove this fact, it suffices to consider the system of equations x\ = ... = x\ Qli . Clearly, the number of solutions is 
the sum of 2013th powers of tribe sizes. The number of equations is less than that of unknowns. So, the statement is 
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a corollary of the Solomon theorem. The assertion remains valid if 2013 is replaced by an arbitrary positive integer; 
the squares (in the definition of tribes) can also be replaced by any (equal) positive integer powers. 

Example 2. The number of pairs of group elements whose product of squares is a cube is divisible by the order of the 
group. This follows from Corollary 2, because the formula 3z x 2 y 2 = z 3 has one bound variable, it occurs once, there 
are no equations without bound variables, the graph is connected, and there are two free variables: 1 + + 1 < 2 + 1. 
This fact can also be derived from the Gordon-Rodriguez- Villegas conjugation theorem. Indeed, this theorem implies 
that the order of the group divides the number of pairs of group elements whose product of squares is conjugate to 
any given element. 

By the same reason, the order of any group divides, e.g. the following numbers: 

- the number of pairs of noncommuting elements whose product of squares is the cube of a noncentral element; 

- the number of pairs of noncommuting elements whose product of squares is a cube if and only if the cube of their 
product lies in the centre; 

- the number of pairs of elements such that either the product of their squares is a cube or their commutator is not 
a square; 

Example 3. The order of a group divides the number of pairs of elements of this group whose product of squares 
is the cube of a commutator (x\x\ = [z,t] 3 ) and square of product is the commutator of cubes of the same elements 
((xix 2 ) 2 — [z 3 ,t 3 ]). It is difficult to derive this fact from the Gordon-Rodriguez- Villegas conjugation theorem, but 
it follows immediately from our main theorem. Corollary 2 gives a stronger statement: the order of a group divides 
the number of pairs of elements whose square of product and product of squares are simultaneously conjugate to any 
given pair of elements. 

The analogy between the Gordon-Rodriguez- Villegas theorem and well-known properties of solutions to systems 
of linear equations (over finite fields) could suggest an idea that, if the rank of the matrix is much less than the number 
of unknowns, then the number of solutions must be divisible by a higher power of the order of the group. A more 
realistic question is the following. 

Is it true that the number of homomorphisms from a finitely generated group H into a group G is divisible by 
\G\ m if H admits an cpimorphism onto a free group of rank ml 
The point is that the number of solutions to a system of coefficient-free equations 

{u(xi, ...,x n )= v(x 1 ,. ..,£„) = ... = 1} 

equals to the number of homomorphisms from the group H = (x\, . . . , x n \ u(x\, . . . , x n ) — v(x\, . . . , x n ) = . . . = 1) to 
the group G. The matrix of the system has rank at most r if and only if the group H admits an epimorphism onto 
the free abelian group of rank n — r. The existence of an epimorphism onto the absolutely free group of the same rank 
is a much stronger property, but nevertheless, the conjecture under consideration is false for m > 1, as the following 
example shows. 

Example 4. The group (x,y, z \ z = z x z v ) has an epimorphism onto the free group of rank two (sending z to 1), but 
the number of solutions to the equation z — z x z v in the symmetric group S3 is not divisible by IS3I 2 = 36. Indeed, 
with z = 1, there are 36 solutions (x and y can be arbitrary). With z = (123), there are 3-3 = 9 solutions (x and y are 
arbitrary transpositions). With z = (321), there are also 9 solutions. If z is a transposition, then there are no solutions 
(by parity). Thus, the total amount of solutions is 36 + 2 • 9. 

3. Proof of the main theorem 

Lemma 1. Under the conditions of the theorem, the first column of the matrix A(ip) vanishes after a suitable invertible 
change of free variables. In particular, the exponent sum of x\ (in new variables) in each cycle of the graph T(ip) is 
zero. 

Proof. The rank of the matrix A(ip) is less than the number of its columns; therefore, some integer (invertible) 
elementary transformations of columns produces a matrix with zero first column. Elementary transformations of 
columns are induced by obvious changes of variables, e.g. the change X 2, ^ X 1 X k , adds zth column multiplied by k to 
jth column. Lemma 1 is proven. 

For example, to annihilate the first column of matrix (2) from Section 1, it suffices to subtract the second column 
from the first one, i.e. the change of variables x 2 — > X2X^ X transform formula (1) into the formula 

Vy3z (([ygy,x 1 gx2X^ 1 }x 1 z(x2X^ 1 )- 3 x 3 1 h(x 2 x^ 1 ) 7 = l) A - (z 8 x 2 = l) V (x\{x 2 x^ff = l)) . (3) 

The graph of this formula is shown in Figure 2 
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(0,-1) (1,1) (0,2) 




Fig. 2 

and the matrix is 










5 
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In what follows, we assume that the first column of A(ip) is zero. 

For each variable t (free or bound), we introduce new symbols where igZ, (their meaning will be 
£(») = t x l = x^ l tx\, but we act formally at this step). Similarly, for each coefficient g, we introduce new symbols g^\ 
where i 6 Z. Now, we transform formula ip as follows: 

1) in all atomic subformulae, replace each symbol t by t^°\ where t is a variable different from x\ or a coefficient; 

2) replace each subword of the form (t^) k x[, where t is a variable or a coefficient, by x l 1 {t lyl+l ^) k ; 

3) repeat step 2) while this is possible. 

This shift of symbols x\ produces a formula without x\ (the exponent sum of x\ vanishes in each cycle of the graph 
and in each atomic subformula without bound variables and, hence, the exponent sum of x\ vanishes in all atomic 
subformulae). For example, formula (3) after these transformations will have the following atomic subformulae: 

[JO) a (0)„(o) (- 1 )a; ( - 1) l«(- 1 ) (x^x^x^A' 1 h^x^x^x^x^x^x^x^ -1 

yy y y , y x 2 1^2 J '2 2 I 2 2 2 2 2 x 2 2 ' 

(*(°)) 8 04°>) 2 = 1, W 
{x^x^f = 1. 



Now, we proceed with transformation of the formula. 

4) In each homogeneous equation a, we replace all symbols iW by t^ %+ ^"\ where the integers j a are chosen such 
that, for each bound variable t, the symbols iW will occur in the entire formula with at most one value of the 
superscript (i); this is possible, because the exponent sum of xi vanishes in each cycle of the graph. 

In formula (4), it suffices to decrease the superscripts by one in the second equation. In the general case, we can 
act as follows. In each connected component K of T, we choose a vertex (variable) yx, for the component Q 
containing the special variable q, we put uq = q. In each homogeneous equation a containing bound variables, we 
choose one of such variables y a and connect each vertex y a by a path p a with the vertex yx such that y a G K . 
The sum s a of the first coordinates of labels of edges of the path p a does not depend on the choice of the path by 
the condition. Put j a = —i a — s a , where i a is the unique number such that occurs in the equation a. The 
sum of the first coordinates of labels of edges vanishes in each cycle, hence, s a = sp if y a — yp, the value j a does 
not depend on the choice of variables y a in equation a, and the changes tW — > m each equation a produce 

a formula such that each bound variable t occurs in the entire formula only with one superscript (i), where i is 
the sum of the first coordinates of labels of edges of any path from to t. 

5) Now, we replace each quantifier Vy and 3y by My^ and 3y^ p \ where p £ Z is the unique number such that y^ 
occurs in atomic subformulae. 

6) Finally, we add equalities defining new symbols to the obtained formula <p, i.e. we replace <p by the infinite formula 




where t ranges over all free variables and all coefficients of the initial formula, i ranges over all integers, and g 
ranges over all coefficients. The symbols grW, where g G G, are considered as free variables of the formula ip' . 

The numbers of tuples satisfying the obtained formula <p' and the initial formula ip are equal. Indeed, the for- 
mula iff = (p A 5 admits as many satisfying tuples as the formula Tp = (p\ x t (i.e. the formula tp with each symbol , 
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where t is a free variable of the initial formula or a coefficient, is substituted by the expression xT~ l tx\). The formula Tp 
is equivalent to ip (i.e. this formulae are satisfied by the same tuples). Indeed, Tp differs from ip in two aspects: 

a) in quantifiers of Tp, symbols y^ occurs instead of y; 

b) in atomic subformulae of Tp, expressions (y^) Xl " occurs instead of bound variables y, where p is the same for all 
occurrences of y. 

Formulae different only in these aspects are obviously equivalent: e.g. (Vya(y, z, ...)) = (\/ta(t z ,z,.. .)), because for 
any g e G, if y ranges over the whole group, then y 9 ranges over the whole group. 

Thus, it suffices to show that the number of tuples satisfying <p' (such tuples are called solutions henceforth), is 
divisible by \C\, where the letter C denotes the centraliser of the set of coefficients of ip (or of <p', equivalently) . 

Consider a solution X = (xi, x[ j \ g^ i = 2, ...,n, j eZj. The tuple (x\ j \ g^^j, i.e. everything but x\, is 

called the tail of the solution X . Let Bx denote the centraliser of the tail of X. Note that Bx C C (because of the 
equations g^> — g in formula (**)). 

We say that two solutions are similar if their tails are conjugate by an element of C. Clearly, this is an equivalence 
relation. The theorem is a corollary of the following proposition. 

Proposition. Each class of similar solutions contains exactly \C\ solutions. 

Let us find the number of solutions similar to X. The number of all possible tails of such solutions is |C|/|i?x|, 
because, on the set of tails of solutions similar to X, the group C acts by conjugation (since a tuple conjugate to the 
tail of a solution Y by an element c e C is also the tail of a solution, e.g., of Y c ) and Bx is the stabiliser of the tail 
of X. 

The number of solutions with the same tail as that of X equals to \Bx\, since, if a tuple with the same tail and 
with the first coordinate x[ is also a solution, then the quotient x^ixx)^ 1 must commute with the tail because of the 
equations 5 in formula (**), i.e. x[ £ B X X\- On the other hand, any element x[ £ B X X\ gives a solution with the 
same tail (as that of X), because the variable X\ occurs in ip' only in subformula S. 

If X' is a solution similar to X, then the number of solutions with the same tail as that of X' equals to \B X > \ = \B X \ 
(if tails are conjugate, then their centralisers are conjugate and have the same order). 

Thus, we obtain that the number of solutions similar to X equals (|C|/|Bx|) • \Bx\ — \C\, that proves the 
proposition and the theorem. 

4. Roots of subgroups 

In Section 1, the following assertion was derived from the main theorem. 

Corollary 4. The number of group elements whose k-th powers belong to a given subgroup is divisible by the order 
of this subgroup. 

Here, we give a direct proof that exemplify the concluding part of the proof of the main theorem. For simplicity, 
we assume that k = 2. Let H be a subgroup of a group G. We are interested in elements x G G such that x 2 6 H; 
such elements are called solutions henceforth. The assertion is implied by the following lemma. 

Lemma. Each double coset HxH contains either or \H\ solutions. 

Proof. Let a; be a solution; its tail is the coset Hx. 

The group H acts (on the right) on the set of tails of solutions from the double coset HxH by the right multipli- 
cation: 

Hy oh = Hyh (= the tail of the solution y h ). 

The stabiliser of the tail Hx is B x = HC\H X : 

Hx = Hxh <^=> heH x . 

Therefore, all possible solutions lying in HxH have precisely \H\/\B X \ different tails. 
How many solutions have the same tail as that of xl 

Hx = Hy => yx^ 1 e H, 

but if y is also a solution, then 

(Hx)x = Hx 2 = H = Hy 2 = (Hx)y, i.e. yx- 1 e H x . 

Thus, each solution y with the same tail as that of x lies in B x x. On the other hand, each element from this coset is 
a solution: 

(bx) 2 — bxbx — bb x x 2 G H, because b, b x , and x 2 lie in H . 
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So, the number of solutions with the same tail as that of x is \B X \. 

Since \B X \ = \B y \ if x and y lie in the same double coset HxH (because B x and B y are conjugate in this case), 
HxH contains \B X \ ■ (\H\/\B X \) = \H\ solutions, that completes the proof. 

Recently, I. M. Isaacs [Isaal2] obtained a character-theoretic proof of this corollary. 

APPENDIX. ON THE MINIMISATION OF THE NUMBER OF BOUND VARIABLES IN FIRST-ORDER FORMULAE 

Dmitrii V. Trushin 

Einstein Institute of Mathematics, The Hebrew University of Jerusalem, 
Givat Ram, Jerusalem, 91904, Israel 
trushindima <Sya ndex.ru 

Let if be a hrst-order formula (in some language) with bound variables y\, ■ ■ ■ ,yk and free variables Xi, . . . , x m . 
Consider the following graph A(c/?) with vertices t/i, . . . , y^: vertices yi and yj are connected by an edge if there exists 
an atomic subformula in if containing the both variables. 

The following assertion shows that the formula ip with disconnected graph A(tp) is equivalent to a formula with 
fewer bound variables. 

Proposition. Any formula ip is equivalent to a formula ip' with connected graph A (</?') such that 

\A(ip')\ ^ max(|Ai|, . . . , |A S |), where Ai, . . . , A s are connected components of A(ip). 

Proof. Let $i be the set of formulae ip such that 

1) all variables of ip (free and bound) lie in the set {x\, . . . , x m } U {t/i, . . . , yk}; 

2) all bound variables of ip lie in the set {j/i, . . . , j/fe}; 

3) if a variable yj occurs in ip (as a bound or free variable), then yj G Aj. 
Note that the classes $i are closed under logical operators and quantifications on yj . 

Let A be the closure of the union 1J $i with respect to logical operators ( V, A, and -i) 

i 

from A can be written in the form 

i s 

ip = \/ f\ Tij, where nj G 
i=ij=i 

and the in form 

I s 

tp = f\ \f rij, where r t] e (C) 

i=i i=i 

because the conjunction and disjunction are mutually distributive, the classes $j are closed with respect to logical 
operations, -.(A VB) = (-.A) A (->B), and -.(A A B) = (-.A) V (-*B). 

Lemma. The ciass A is closed with respect to quantifications on the variables yj . 

Proof. Note that 



Clearly, each formula 
(D) 



Vy(Mv)*Mv)) = Wi(ll)) A(VyV>2(t/)) and Vy^i (y) V ip 2 ) = (Vyipi(y)) V r/> 2 , 

where, in the second equality, y is not a free variable of the formula ip2- For the existence quantifier we have similar 
equalities 

3y{tpi(y) Vtp2(y)) = (3V>i(y)) V (3yip 2 {y)) and 3y(%pi(y) A%p 2 ) = (3yipi(yj) A ip 2 , 

where, in the second equality, y is not a free variable of the formula ip 2 . 
Suppose that a formula tp G A is written in the form (D): 

ip = (m A ... A n s ) V ... V (r a A ... A ris) 

and a variable belongs to a component A t . Then 

3yj (rn A ... A r ls ) V ... V (r a A ... A r Js ) = {3 Vj (r n A ... A r ls )) V ... V {3y 3 (r n A ... A r; s )) = 

= (ru A ... A (3t/jr u ) A ... A r ls ) V ... V (r a A ... A (3j/j)r /t A ... A r ls ) , 

where the first equality is valid for any formulae and the second equality is valid, because, in ith. term of the disjunction, 
only r it may depend on yj. A similar transformation can be made for the universal quantifier using the form (C) of 
the formula tp € A. Lemma is proven. 
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Let us proceed with the proof of proposition. By condition, each atomic sub formula of ip belongs to a class <I>j, 
i.e., in particular, this subformula lies in A. Since A is closed with respect to logic operators and quantifications on yj 
(by Lemma), we obtain that ip £ A, i.e. 

I s 

p = V A r y 

i=ij=i 

Let us assume that the maximum of |Aj| is attained at Ai. Then, in each formula ry, where j ^ 1, we change the 
names of bound variables from Aj for the names of variables from Ai. Since Ai is the largest component, we can assign 
different names from Ai to different variables (in each particular formula ry). This renaming produces a formula if' 
equivalent to ip and all bound variables of p' belong to Ai. 
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